Sound Recognition in Mixtures
نویسندگان
چکیده
In this paper, we describe a method for recognizing sound sources in a mixture. While many audio-based content analysis methods focus on detecting or classifying target sounds in a discriminative manner, we approach this as a regression problem, in which we estimate the relative proportions of sound sources in the given mixture. Using certain source separation ideas, we directly estimate these proportions from the mixture without actually separating the sources. We also introduce a method for learning a transition matrix to temporally constrain the problem. We demonstrate the proposed method on a mixture of five classes of sounds and show that it is quite effective in correctly estimating the relative proportions of the sounds in the mixture.
منابع مشابه
Effect of sound classification by neural networks in the recognition of human hearing
In this paper, we focus on two basic issues: (a) the classification of sound by neural networks based on frequency and sound intensity parameters (b) evaluating the health of different human ears as compared to of those a healthy person. Sound classification by a specific feed forward neural network with two inputs as frequency and sound intensity and two hidden layers is proposed. This process...
متن کاملDifferent Profiles of Verbal and Nonverbal Auditory Impairment in Cortical and Subcortical Lesions
A B S T R A C T Introduction:We investigated differential role of cortical and subcortical regions in verbal and non-verbal sound processing in ten patients who were native speakers of Persian with unilateral cortical and/or unilateral and bilateral subcortical lesions and 40 normal speakers as control subjects. Methods: The verbal tasks included monosyllabic, disyllabic dichotic and diotic tas...
متن کاملLearning Musical Instruments from Mixtures of Audio with Weak Labels
We are interested in developing a system that learns to recognize individual sound sources in an auditory scene where multiple sources may be occurring simultaneously. We focus here on sound source recognition in music audio mixtures. Many researchers have made progress by using isolated training examples or very strongly labeled training data. We consider an alternative approach: the learner i...
متن کاملPrediction-driven Computational Auditory Scene Analysis for Dense Sound Mixtures
We interpret the sound reaching our ears as the combined effect of independent, sound-producing entities in the external world; hearing would have limited usefulness if were defeated by overlapping sounds. Computer systems that are to interpret real-world sounds – for speech recognition or for multimedia indexing – must similarly interpret complex mixtures. However, existing functional models o...
متن کاملAnalysis of motor fan radiated sound and vibration waveform by automatic pattern recognition technique using “Mahalanobis distance”
In recent years, as the weight of IT equipment has been reduced, the demand for motor fans for cooling the interior of electronic equipment is on the rise. Sensory test technique by inspectors is the mainstream for quality inspection of motor fans in the field. This sensory test requires a lot of experience to accurately diagnose differences in subtle sounds (sound pressures) of the fans, and t...
متن کاملNSF-CAREER: The Listening Machine IIS-0238301 2003–2008 Final Report
This six-year project started with the idea of applying sound recognition and separation techniques that had originated in speech recognition to a broader domain of environmental sound mixtures. As it proceeded, the work diversified into several distinct areas, reflecting the different directions of the graduate students primarily supported by the project: Manuel Reyes and Keansub Lee worked on...
متن کامل